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Abstract 

We analyze the controllability of a two-layer network, where driver nodes can be chosen 
randomly only from one layer. Each layer contains a scale-free network with directed links, 
and the node dynamics depends on the incoming links from other nodes. We combine the in¬ 
degree and out-degree values to assign an importance value w to each node, and distinguish 
between peripheral nodes with low w and central nodes with high w. Based on numerical 
simulations, we find that, the controllable part of the network is larger when choosing low w 
nodes to connect the two layers. The control is as efficient when peripheral nodes are driver 
nodes as it is for the case of more central nodes. However, if we assume a cost to utilize nodes 
which is proportional to their overall degree, utilizing peripheral nodes to connect the two 
layers or to act as driver nodes is not only the most cost-efficient solution, it is also the one 
that performs best in controlling the two-layer network among the different interconnecting 
strategies we have tested. 


1 Introduction 

How can we efficiently control the dynamics on complex multilayer networks if we are able to 
control only a few nodes? This problem is of importance in the growing field of interconnected 
networks mm- Such networks consist of multiple layers that each contain a complex network, 
and additional links between nodes of different layers. In addition to its structural properties, 
such as degree, each node is characterized by a dynamical variable Xi{t) that changes dependent 
on the interaction with other nodes. Hence, we face a combined problem in which the dynamics of 
N coupled equations, where N is the total number of nodes, is exacerbated by the rather complex 
coupling between these nodes both through intra-layer and inter-layer links. The question then 
is (a) how many, and (b) which of these nodes we need to control in order to control most of the 
whole network. 

According to control theory, controllability characterizes the ability to drive a dynamical sys¬ 
tem from any initial state to any desired final state in finite time, by attaching control signals 
to a carefully chosen set of driver components. In the context of complex networks, fairly re¬ 
cently Liu et al. developed an analytical framework to study the controllability of single-layer 
complex networks. This assumes a linear dynamics for the nodes. Recent efforts have aimed at 
understanding the interplay between the topological structure and the controllability of complex 
networks However, a full control of large-scale complex networks has hardly been achieved. 
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In addition to the sheer size of such systems, there are constraints in accessing all of the driver 
nodes necessary to control the system |5]. This limitation motivates our research, namely to un¬ 
derstand how this control can be achieved with a rather small number of driver nodes. We further 
want extent the scope from single-layer to multilayer networks and consider, as an additional 
challenge, restricted access to only one layer of a multilayer network. 

This merges two research lines, namely controllability of complex networks and multilayer net¬ 
works, which were jointly discussed so far in a few publications only iHH]. All these works, 
however, focus on the controllability of the whole system and there is no restriction when choos¬ 
ing driver nodes, which is a main issue addressed by our paper. More precisely. Yuan et al.|6] 
deployed the exact controllability theory to study controllability of multiplex networks; Nie et 
al.[7] analyzed the impact of degree correlation on controllability; and Menichetti et al.|8j ad¬ 
dressed the robustness and stability of control configuration. 

We can already build on a number of works that address the role of interconnecting links between 
different network layers |9HI3. It was shown recently that structural network properties can 
change even in an abrupt way |13) . Furthermore, interconnecting links can significantly affect 
the way dynamic processes evolve in multilayer networks 

However, even in the simplest cases, knowing the dynamics on a multilayer network does not 
mean that we also can control it, i.e. steer the dynamics toward a desired final state. This 
problem can be re-casted as a design problem: Given a network with two layers, how can we 
connect these layers with a limited number of inter-layer links such that the whole network can 
be controlled by using driver nodes from only one layer? If not all the nodes in the network 
can be controlled, what is the size of the controllable subnetwork that can be controlled by a 
fixed number of driver nodes. In our work, which can be considered as a proof-of-concept, using 
extensive computer simulations to test different driver node selection criteria and four distinct 
network interconnecting strategies, we demonstrate that a) the whole two-layer network can 
be controlled by driver nodes from just one layer, b) to maximize the controllable subnetwork, 
peripheral nodes should be used to connect the two layers c) choosing peripheral nodes to control 
the network can be as efficient as choosing central nodes. 


2 Model description 

We consider a two-layer network G(y,E) with the number of nodes N = \ V\ = Nq Ni, where 
layer 0 contains Nq and layer 1 A^i nodes. The links in each layer are directed, i.e. each node i 
has an in-degree kf^ and an out-degree Building on Ref |18| . both are drawn independently 
from a power-law degree distribution P{k) oc k~'^ with /Cmin = 1 and 7 = 2, using the uncor¬ 
related configuration model (Results for different power-law exponent are shown in supporting 
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information) |19) . We can combine these degree valnes to assign an importance value Wi to each 
node 

w, = {kTT ( 1 ) 

Here a is a free parameter ranging from 0 to 1. As a increases, more importance is attributed to 
the in-degree in the calculation of w. We refer to central nodes as nodes with a high importance 
value w. 

Layers 0 and 1 are connected by L additional bidirectional interlayer links (We also test the case 
for interlayer links of randomly assigned direction, as shown in supporting information). Only 
one interlayer link per node is allowed at maximum, with q = L/Nq as the fraction of interlayer 
links. 

Our main assumption is that we can only access the A^o nodes on layer 0, in order to control the 
N nodes in the whole network. If we can control Nc < Nq nodes of layer 0, what is the number 
Nh < N oi nodes in the whole network that we can control, directly or indirectly, by means of 
Nc'l Ideally, we want to choose Nc as small as possible, while Ni, reaches values close to N. 

The choice of this ideal Nc of course also depends on the strategy by which the two layers are 
coupled using the q interlayer links. Hence, our main research question is how to couple these 
two layers in order to maximize N^,. Given the scale-free degree distribution, for each layer we 
can distinguish between hubs, i.e. nodes with a high importance value Wi, and nodes with low Wi 
that are only loosely integrated in the layer network. We refer to the latter as periphery. 

For the connection of the two layers, we can now think of four different strategies |2^, shown 
in Fig. (CC) nodes with high importance value w in layer 0 are connected to nodes with high 
importance value w in layer 1, (CP) nodes with high w in layer 0 are connected to peripheral 
nodes with low w in layer 1, (PC) peripheral nodes in layer 0 are connected to nodes with high 
w in layer 1, and (PP) peripheral nodes in layer 0 are connected to peripheral nodes in layer 1. 

The strategy to couple the two layers now consists of two steps: (i) calculate Wi and rank the 
nodes with respect to their Wi, for each of the two layers separately, (ii) until q is reached, 
deterministically choose nodes according to their rank on each of the two layers. I.e. dependent 
on the different strategies, we link high or low ranked nodes from the two layers until L interlayer 
links are formed. 


3 Structural Controllability 

In order to apply the framework of structural controllability |3|, we need to make assumptions 
about the dynamics that change intrinsic properties of the nodes. Let us assume that each node 
is characterized by a variable Xi{t). As shown in Figure [^, some of these nodes can be influenced 
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Figure 1: (Color online) Illustration of the coupling strategies for two layer networks, (a) The 
main scenario where we aim to control the system of networks using only driver nodes from 
layer 0. (b) The PP interconnecting strategy, (c) The CP interconnecting strategy, (d) The CC 
interconnecting strategy. Nodes colored in red are coupled by interlayer links denoted by red 
dashed lines. In the illustration, the two layer networks are interconnected by L = 3 links. 


by external signals Uk{t), which shall later be used to control the dynamics of the whole network. 
Let U(t) G be the vector of control signals. Then the matrix B G defines which nodes 

are directly controlled by the external signals Uk{t) {k = 1 , ..,Nc), with the element bij 7 ^ 0 if 
signal j is attatched to node i. 

The framework of structural controllability requires us to choose a linear dynamics for Xi{t), 
which reads in vector notation X(t) = {xi{t),X 2 {t), ...,XN{t)} with X G 

X(t) = AX(t)+ BU(t), (2) 

A G is the interaction matrix with elements aij (i,j = 1, ..., N) that describe the weighed 

influence between any two nodes either within or across layers in the multilayer network. Ac¬ 
cording to the Kalman rank condition [21], the dynamical system defined by (|^ is controllable, 
i.e. it can be driven from an initial state to any desired state, if and only if the controllability 
matrix C = [B, AB, A^B,..., A^~^B] G j-ank, i.e., rank(C)=A. 

In some cases, the exact value of the nonzero elements in A and B is not available, and the precise 
computation of rank(C) is therefore unattainable. For those cases, the weaker requirement of 
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structural controllability |22| was introduced. It treats A and B as structured matrixes, i.e. their 
elements are either fixed zeros or free parameters. 

The system is structurally controllable if the maximum rank of C, denoted as rankg{C), can 
reach N as a function of the free parameters in A and B. Based on |22) . Liu et al. [3] derived the 
minimum input theorem to identify the minimum number of driver nodes needed to control 
the whole network of N nodes. 

In real situations, some driver nodes necessary for control may not be accessible or the number 
of driver nodes Nu may be too large to be efficiently influenced by the limited number of control 
signals. Hence full control of the network cannot be achieved. In those scenarios, we are interested 
in the size of the subnetwork, given by Ni, = rankg{C) < N, that can still be controlled by a 
given set of driver nodes, Nc < N^- 

If there is only one driver node i denoted by B, the value of Ni, defines the control centrality of 
i |23[ 124] . For more than one driver node, there is an overlap in the subspace controlled by each 
node and the sum of the control centralities of all driver nodes may overestimate Nf,. 

Despite that analytical treatment using tools from statistical physics can be found in literature 
for the case of when the controllability of the whole system is considered, to the best of 
our knowledge, there is no analytical method to predict Nf, when we focus on the controllable 
subsystem. Even for the simplest case where only a single driver node is considered, and for very 
particular topologies, such as a directed acyclic graph in which a unique hierarchical structure 
can be identified, can the controllable space size of one node be predicted by its hierarchical level. 
Therefore, for a general case with more than one driver node, in order to effectively determine 
Nb, we deploy a linear programming approach |25j . 

The algorithm works by constructing an auxiliary network that is larger than the original one, to 
identify the cycle partition structure |25] that contains the maximum number Nb of controllable 
nodes in the complex network. We first construct an initial auxiliary network H{E,V) that 
contains Nh = \ V\ = N + Nc nodes. Nc is the number of control signals, which are represented 
by an additional set Sc of nodes in the auxiliary network. Regarding its topology, H preserves 
all the links defined in the matrix A, but has additional links from the set Sc to the driver nodes 
Nc (one link per driver node, indicated in Fig. [^by the zig-zag arrows). Next, we identify the 
reachable network H' which is given by those nodes that can be reached via a directed path from 
the set Sc- Then, we change the topology of H' by adding directed links of weight zero from any 
node within H' (excluding the set Sc) to all nodes in the set Sc- Also, we add self-loops of weight 
zero to all the nodes in H', to arrive at the auxiliary network H'{E, V) 

From H', we can now calculate Nb as the optimal value of the integer linear problem 

max E., ^Wehe (3) 
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Figure 2: (Color online) Uh as a function of the fraction of interlayer links and the number 
of driver nodes. The top panel shows the characteristics of as a function of the fraction of 
interlayer links q. In (a) a = 0.5, in (b) a = 0.2.The results are produced with 30 driver 

nodes. The bottom panel shows the characteristics of Ub as a function of the number of driver 
nodes Nc, for q = 0.4. In (c) a = 0.5, in (d) a = 0.2. Change of the number of driver nodes 
to other values smaller than doesn’t alter our hndings. All the results are produced with 
Nq = Ni = 2000. The data points are obtained over 100 simulations, with error bars represent¬ 
ing standard deviation. 

where We denotes the weight of link e, and he G {0,1} is a binary variable indicating whether 
one link is chosen to be part of the optimal solution of the linear programming problem. The 
subjections 

^ he = 1; ^ he = 1 Vn G 14 (4) 

e leaves v e enters v 

guarantee that the optimal solution to Eq. (|^ forms a cycle partition that spans the graph ff'. 

4 Results and discussions 

We now apply the above optimization method to treat the multilayer networks that were con¬ 
structed by the four different coupling strategies. To obtain the results, we use an ensemble 
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approach, i.e. we keep the configuration of each layer constant, but generate 5 multilayer net¬ 
work realizations with Nq = A^i = 2000 for each possible parameter configuration and for each 
coupling strategy. The importance parameter a and the fraction of interlayer links q are both 
varied between 0 and 1 in steps of 0.05. Eventually, for each configuration of parameters and 
each coupling strategy, we randomly sample 100 sets of driver nodes from layer 0, i.e. 20 per 
multilayer network realization. This results in 1.6 x 10® different network configurations in total. 

Our main interest is in the relative size of the network that can be controlled this way, i.e. we 
calculate nf, = /N for each parameter configuration and coupling strategy, rih = 1 indicates 
that for this configuration the whole system can be controlled. Our results are presented in two 
different ways: In the top panel of Fig. we compare Ub dependent on the fraction of interlayer 
links q, with the number of driver nodes Nc kept constant, while in the bottom panel of Fig. 

Ub is shown dependent on the number of driver nodes Nc, with q kept constant. The results are 
presented for two values of the importance parameter a, but results for varying a are shown in 
Fig.jH 

From the top panel of Fig. we report two observations: (i) For the PP strategy, a sizable control 
of the multilayer network (i.e. Ub reaches values close to 1) can be reached already for a fraction 
of interlayer links q below 1, i.e. not every node in the two layers need to be linked. E.g., for 
q = 0.5, nb already ranges between 0.8 and 1. (ii) Among the four different strategies to couple 
the two layers, the PP strategy fares best. I.e. with respect to control, coupling peripheral nodes 
is more beneficial than coupling nodes with high importance value. The results are similar for 
the two a values chosen. 

From the bottom panel of Fig. we observe again two interesting findings for this particular 
system: (i) the range of the controllable network, Ub, does not strongly vary if we increase the 
number of driver nodes Nc from one to a value much smaller than the system size, such as 200 
as shown in the figure. Ub never reaches 1, given the small values of Nc <C 2000, but already 
reaches remarkable values between 0.7 and 0.95 (Results for larger values of Nc are reported in 
the supporting figure Fig.S3(a)). (ii) Again, the PP strategy to link peripheral nodes in both 
layers allows of a considerably better control of the network. This distinction becomes most 
pronounced for a = 0.5. 

The superiority of the PP strategy in connecting the two layers is further demonstrated in Fig. 
where we explore the full parameter space of a and q. Fig. [^a) shows, for the PP strategy, the 
gradual increase in Ub as the fraction q of interlayer links increases. There is no strong dependency 
on the importance parameter a, only a slight improvement for a = 0.5 which weights the in¬ 
degree and the out-degree equally. Fig. ib -d) illustrate the difference between the PP strategy 
(a) and the remaining other strategies, by just plotting the difference in nb compared to Fig.|^a). 
This difference is always positive, indicating the advantage of the PP strategy. More remarkable. 
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Figure 3: (Color online)(a) Color map encoding the fraction of controlled network nt, on the 
a — q parameter plane for PP strategy, (b-d) Color map encoding the difference between rih for 
PP and Uh for CP,CC,PC strategies. A positive value indicates that for PP is greater than 
the corresponding strategy. The above results are obtained with Nc = 30 driver nodes. 


the difference becomes the largest for moderate values of q and a. Obviously, for q close to 0 or 
q close to 1, the four strategies become indistinguishable. 

So far, we have only investigated the impact of different strategies in linking the two layers. 
But we did not discuss whether it is more benehcial to choose central nodes or peripheral nodes 
from layer 0 as driver nodes. In fact, to obtain Figs. mi we have randomly sampled the set 
of driver nodes from layer 0. Therefore, in Fig. iwe now investigate the impact of central or 
peripheral driver nodes on the controllability of the multilayer network. Specihcally, we compare 
two scenarios: in Fig. i(a) the driver nodes are sampled from the top 10% nodes of high w in 
layer 0, whereas in Fig. i(b) the driver nodes sampled from the top 10% nodes of low w values 
in layer 0. A comparison of the four discussed strategies to connect the two layers and all values 
of g, shows that the difference in nb can be almost neglected (it is less than 1% as shown in 
the supporting hgure Fig.S3(b)). This indicates that, by injecting control signals into peripheral 
nodes, we can control as much of the total network as by injecting control signals into central 
nodes. 
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Figure 4: (Color online) Uh as a function of the fraction of interlayer links. In (a) driver nodes 
are sampled from nodes of high w. In (b) driver nodes are sampled from nodes of low w. The 
results are produced with Nc = 30 driver nodes. 

5 Conclusions 

In this paper, we study the controllability of two-layer directed networks with numerical sim¬ 
ulations. In our model, we distinguish between two different kind of nodes: (i) the nodes that 
should be chosen to connect the two layers, in order to maximize the number of controllable 
nodes in the whole network, N),. (ii) the driver nodes that should be chosen on layer 0 to control 
this subspace. These nodes do not necessarily have to be the same. The number of interlayer 
connections is determined by the parameter q = L/Nq, whereas the number of driver nodes can 
vary as well, Nc < A^o- For ^ given Nc, increasing q usually leads to increasing Nf,, until Nf, 
reaches its saturation. 

At the same time, for given q and Nc, there is a preferred coupling strategy to maximize Nf^, which 
is coupling peripheral nodes in both layers. Assuming that it is less costly to access peripheral 
nodes as compared to central nodes, the PP strategy would also be the most cost-efficient strategy. 
We emphasize that this hnding differs from HU where the CC strategy was preferred. This was 
found for an undirected network, on which a synchronization dynamics was investigated, whereas 
we consider a directed network, on which we assume a linear dynamics. 

As a second important hnding, we have shown that the control of the network can be as effectively 
achieved by choosing Nc from the peripheral nodes as from the central nodes. Referring to the cost 
argument above, choosing peripheral nodes as driver nodes is both effective and cost-efficient. 

The third hnding points to the size of the controllable subspace N},. Here, we show that it is 
sufficient to choose driver nodes from just one layer, to control the whole two-layer network, in 
accordance with earlier hndings jU- Dependent on the fraction of interlayer links, the full control 
can be even achieved with a small number of driver nodes, Nc <C A^o (e-g- A^c = 30 for Nq = 2000 
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and q = 0.6), given that the most efficient PP strategy is used for linking the layers. This again 
emphasizes the importance of peripheral nodes in controlling multilayer networks. 

The above findings were obtained for the particular system of networks used in our simulations. 
However, we also checked the robustness of our findings with the alternative configurations 
presented in the supporting information. These configurations include the case with randomly 
assigned directionality for the interlayer links, and the case of scale free networks with another 
network exponent. This indicates that peripheral nodes may play similarly important role in 
connecting and controlling multilayer networks for systems with other network configurations 
as well. In this sense, our results could be useful for practical applications. As an example let 
us consider the coupling between a power grid and a communication network. A regulator may 
want to achieve better controllability of the full system by accessing a small number of driver 
nodes from the communication network, and this could be achieved by identifying optimal ways 
to couple the two networks using a methodology similar to the one presented in our paper. 
However, a more concrete exploration of the way our approach can be used for real systems is 
beyond the scope of the current manuscript, and is left for future studies. 

Acknowledgements. We gratefully acknowledge helpful discussions with Y.Y Liu from Har¬ 
vard Medical School. A.G. and F.S. acknowledge financial support by the EU-FET project MUL¬ 
TIPLEX 317532. 

Appendix 

We now test the robustness of the our main results by considering two alternative configurations 
of our model: 

a) We build two-layer networks using a power law degree distribution with 7 =2.5. 

b) We connect two layers of networks by interlayer links with directions assigned randomly. 

In Figures SI and S2 we show results obtained with the above alternative configurations using 
networks of size Nq = Ni = 2000. The presented results are averaged over 100 simulations with 
error bars representing the standard deviation. Even though some effects are less pronounced 
(for example, see Fig.SI wrt Fig.2), the results are in-line with our main findings. 
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Figure SI: (Color online) rib as a function of the fraction of interlayer links and the number of 
driver nodes. The top panel shows the characteristics of rib as a function of the fraction of in¬ 
terlayer links q. The bottom panel shows the characteristics of rib as a function of the number 
of driver nodes Nc, for L = 800 interconnecting links. In (a)&:(b) 7 =2.5, and the two network 
layers are connected by bidirectional interconnecting links. In (c)&(d) 7 =2.5, and the two net¬ 
work layers are connected by interconnecting links whose directions were assigned randomly. 
The results are produced with 30 driver nodes and a = 0.5, and are consistent with the results 
shown in Fig. 2. 
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Figure S2: (Color online) rib as a function of the fraction of interlayer links. The top two fig¬ 
ures are obtained with 7 =2.5, and two layers of networks are connected by bidirectional inter¬ 
connecting links. The bottom figures are produced with 7 =2.0, and two layers are connected 
by interconnecting links whose directions were assigned randomly. In (a)&:(c) the driver nodes 
are sampled from nodes of high w, while in (b)&:(d) the driver nodes are sampled from nodes 
of low w. All the results are produced with Nc = 30 driver nodes, and are consistent with the 
results shown in Fig. 4. 



Figure S3: (Color online) rib as a function of the number of driver nodes Nc, for q = 0.4. In¬ 
creasing the number of driver nodes Nc to Nq = 2000 confirms that the PP strategy to link 

peripheral nodes in both layers allows to better control the whole network. The data points are 
obtained over 100 simulations, with error bars representing standard deviation, (b) The ratio 
of average rib with driver nodes of low w over the average rib with driver nodes of high w. This 
sub-figure shows that the difference in rib is marginal and can be safely neglected. The figure is 
produced using the two-layer network configuration that is discussed in the main text. 
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